Incorporating Concreteness in Multi-Modal Language Models with Curriculum Learning
نویسندگان
چکیده
Over the last few years, there has been an increase in studies that consider experiential (visual) information by building multi-modal language models and representations. It is shown several acquisition humans starts with learning concrete concepts through images then continues abstract ideas text. In this work, curriculum method used to teach model concrete/abstract their corresponding captions accomplish modeling/representation. We use BERT Resnet-152 on each modality combine them using attentive pooling perform pre-training newly constructed dataset, which collected from Wikimedia Commons based words. To show performance of proposed model, downstream tasks ablation are performed. The contribution work two-fold: A new dataset words, a approach proposed. results contributes success model.
منابع مشابه
Incorporating active learning into a traditional curriculum.
During the past three academic years, "self-learning exercises" (SLEs) have been incorporated into the Medical Physiology course for first-year students at the Morehouse School of Medicine. Roughly 20-30% of the material covered in the course is presented to the students in the form of these exercises, instead of in lectures. The exercises are intended to help the students develop skills in act...
متن کاملLearning Multi-modal Similarity
In many applications involving multi-media data, the definition of similarity between items is integral to several key tasks, e.g., nearest-neighbor retrieval, classification, and recommendation. Data in such regimes typically exhibits multiple modalities, such as acoustic and visual content of video. Integrating such heterogeneous data to form a holistic similarity space is therefore a key cha...
متن کاملIncorporating E-learning in teaching English language to medical students: exploring its potential contributions
Background: The spread of technology has influenced different aspects of human life, and teaching and learning are not exceptions. This study aimed to examine the potential contribution of the use of technology in teaching English language to medical students. Methods: This qualitative-action research study was conducted in Birjand University of Medical Sciences (BUMS), with 60 medica...
متن کاملExploiting Multi-modal Curriculum in Noisy Web Data for Large-scale Concept Learning
Learning video concept detectors automatically from the big but noisy web data with no additional manual annotations is a novel but challenging area in the multimedia and the machine learning community. A considerable amount of videos on the web are associated with rich but noisy contextual information, such as the title, which provides weak annotations or labels about the video content. To lev...
متن کاملLearning Multi-modal Control Programs
Multi-modal control is a commonly used design tool for breaking up complex control tasks into sequences of simpler tasks. In this paper, we show that by viewing the control space as a set of such tokenized instructions rather than as real-valued signals, reinforcement learning becomes applicable to continuous-time control systems. In fact, we show how a combination of state-space exploration an...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2021
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app11178241